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1. INTRODUCTION 


1.1 Overview 


W9960CF is a single chip multi-protocol high performance video CODEC offered by Winbond 
Electronics Corp. for video compression and decompression applications such as video- 
conference. 


W9960CF is composed of a high performance RISC processor core (VRISC), function blocks for 
video encoding/decoding and a downloadable program memory such that the system can be run- 
time configured for a variety of video applications. The function blocks in W9960CF are 
computing engines for: Discrete Cosine Transform (DCT), Inverse Discrete Cosine Transform 
(IDCT), Motion Estimation (ME), Motion Compensation (MC), Quantization (Q), Dequantization 
(Q") and VLE/VLD (Variable Length Encoding/Decoding) algorithms. Using these function 
blocks, the firmware can direct the VRISC processor to perform ITU-T H.261/ H.263 
simultaneous video bitstream encoding/decoding. 


Although W9960CF is designed for multiple standard video encoding and decoding, it is 
particularly optimized for H.261/H.263 video-conference application. W9960CF supports all 
video resolutions as specified in the H.261/H.263 standards, including SQCIF, QCIF and CIF 
at high video frame rate. For CIF resolution in H.261 and QCIF resolution in H.263, 
specifically, W9960CF delivers excellent encoding/decoding performance. 

Implementing most the advanced video encoding options, W9960CF enables lowest video 

data rate such that maximum frame rate can be achieved through ISDN, PSTN and Internet 
networks. The advanced encoding options supported are: Unrestricted Motion Vector Mode, and 
PB-frame Mode. With half-pixel search for motion estimation function, W9960CF further 
enhances video quality with even lowered video bitstream data rate. 


W9960CF is also designed with a most cost-effective PC based video-conference solution in 
mind. It has a digital live video interface for glueless support a of major video decoders and 
coefficient-programmable filter circuitry for input video enhancement. For video displaying, it 
supports PCI master mode capability to access video frame buffer of PCl-based graphics 
adapters. W9960CF also provides an interface for audio modules. The audio modules connected 
can be a CODEC for G.711/G.722/G.723/G.728 standards, or a PCM CODEC for audio raw 
data. W9960CF uses ordinary FPM or EDO DRAM as working storage. Figure shows application 
block diagram. 


Video 
Reed aa ieee 

i D d 
tego —_— Video CODEC DRAM 


Digital Camera 
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1.2 Features 


e Built-in RISC processor core and 4.5Kx22 bits program memory 

e Supports ITU-T H.263 and H.261 simultaneous video encoding and decoding 
e Supports SQCIF, QCIF and CIF video resolutions 

e Supports H.263 Annex D Unrestricted Motion Vector mode 

e Supports H.263 Annex G PB-frames Mode 

e Supports both integer search and half-pixel search motion estimation 

e Built-in BCH error correction and framing error detection circuitry 

e Supports YUV 4:2:2 video input interface for video camera 

e Built-in filter circuit with programmable coefficients for input video enhancement 
e Selectable video output formats including YUV 4:2:2 and RGB 5:6:5 

e Supports PCI master mode to access graphics adapters for video displaying 
e Supports panning and zooming over video input 

e Provides audio connection to external audio DSP modules 

e Uses conventional FPM and EDO DRAM 

e No SRAM required 

e Optimized for 3.3 volts operation 

¢ 0.5um CMOS technology 

¢ 208-pin PQFP package 
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2. PIN DESCRIPTION 


2.1 Pin Definition 


| Piname | Pino. | type | Function = 


PCI BUS (50 pins) 


AD31-ADO 204-205,4-9, Address and Data are multiplexed on the same PCI pins. The 
16-23, 42-49, address phase is the clock cycle in which FRAME# is 
57-60, 65-68 asserted. During data phase AD7-ADO contain the least 
significant byte (Isb) and AD31-AD24 contain the most 
C/BE3-C/BEO |14,28,37,56 Bus Command and Byte Enables are multiplexed on the 
same PCI pins. During the address phase of a transaction, 
C/BE3#-C/BE0# define the bus command. During the data 
phase C/BE3#-C/BE0O# are used as Byte Enable 


pak —ti‘édgeStsté‘i:Ct Parity is even parity across AD31-ADO and C/BE3#-C/BEO# 
FRAME# 2 __|_o_ FRAME# is asserted to indicate a bus transaction is beginning 


TRDY# Target Ready indicates the ability of target agent to complete 
the current data phase of the transaction 

IRDY# Initiator Ready indicates the ability of bus master to complete 
the current data phase of the transaction. 


INTA# }2¢__o__ Interrupt A is used to request an interrupt 


STOP# Stop indicates the current target is requesting the master to 
stop the current transaction. 


DEVSEL# Device Select, indicates the driving device has decoded its 
address as the target of the current access 

IDSEL Initialization Device Select is used as chip select during 
configuration read and write transactions. 

PERR# -—_{— Parity Error is for the reporting of data parity errors 
System Error is for reporting address parity errors, or any 
other system error where the result will be catastrophic. 


Request indicates to the arbiter that W9960 desires use of the 
bus 


Fan indicates that W9960 access to the bus has been 
granted 


PCI Clock 
PCI Reset 


Winbond Confidential 7 June 1997 


W9960CF 


Winbond 


Electronics Corp. 


GPIO BUS (4 pins 


GPIO3- 153, 160-162 fo feonnec to GPIO bus to video decoder or coprocessor 
GPIOO 


| PinName | Pino. | type | Function = 


DRAM BUS (50 pins) 


MD31-MDO_|141-132,127- DRAM Data Bus 
118, 113-108, 
101-96 
OE1# -OE0# |74,70 Lo | DRAM Output Enable 


RAS 1# - 81,73 DRAM Row Address Strobes 
RASO# 


CAS 1# - 80,72 DRAM Column Address Strobes 
CASO# 


MA9-MAO _ {91-82 Lo | DRAM Address Bus 


WE1# -WEO# |75,71 DRAM Write Enable; WE1# for MD31-MD16, 
and WE0# for MD15-MDO 


AUDIO BUS (5 pins ) 


prs igs 10 Receiver Frame Signal 
mes iigo— 10 ransmission Frame Signal 
pr —is0_ | |ransmission Data 
pp ist | Receiverbata 


SCLK clock of serial port 
VIDEO BUS ( 20 pins ) 


VD15-VDO 186-184, 179- Video Data Bus 
169, 164-163 
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| Piname | PinNo. | type | Function = 
ISA-Like BUS ( 30 pins ) 


196-197, 207, 8-bit Parallel Data Bus 
25-26, 54, 62- 


77-78, 93-94, 16-bit Parallel Address Bus 


103, 106, 115- 
116, 129-130, 


1O Read Control signal 
IO Write Control signal 


U_ {External Interrupt #1 


IU‘ |External Interrupt #2 


External Boot ROM Enable 


When BTEN¢# low active, firmware down-load by through ISA- 
Like Bus automatically after RST# inactive 


External Boot ROM Chip Select 
LOCK SOURCE ( 2 pins ) 
Internal Clock 

Video Clock 
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| PinName | PinNo. | Type] = Function = 
POWER (39 pins ) 


1,10,24,38,53, 3.3V DC Power supply 
61,76,92,105, 
114,128,142, 
157,165,180, 


195 


3,13,27,41,50, Ground 
55,64,79,95, 

102, 

107,117,131, 

145, 154, 159, 

168, 183, 198, 


VDD5V 52,104,156 | |s.ov DC Power suppl 


NC (8 pins ) 


ee 


Winbond Confidential 10 June 1997 


Winbond 


Electronics Corp. 


2.2 PinOut Diagram 


W9960CF 


s RA ss 
. AA RR - : 
p2vs Vv DDVMMMMMMMMMMV 5 pV MMMMMMMMMMy AA v MMMM Av 
p2s;c TR SRRDDDDDDDDDDDSppDDDDDDDDDDDS PPDDDDDDDS DPD 
5Rs oLDDFFENNS;7D3322222222SpRD2211111111S8P0p119876S80D 
vigskKRTSSCCB B1098765432| 110987654328 B10 1 Ry 
oP 3 10 98 76 5 
JER ERRRRRRRGRERERREE BERRRORRRRGRERERRRRERREREEE GRGRELRGRRRERERRRRERRRRREEE GRRRRGRERERRRRREEE GD 
1 1 1 
VDDI 5 3 1 =i VDD5V 
SAADDR1@4 5 0 5 = SAADDR4 
VSsSsI =i VSSB 
GPlo2 FI MD5 
GPIO1 FI MD4 
GPIO0 FI MD3 
vbo FI MD2 
vDi FT MD1 
VDDB Ft MDO 
SAADDR14 4 VSSB 
SAADDR15 4 (1  SAADDR3 
VSSB (1  SAADDR2 
Vb2 VDDB 
vbD3 FI MAg 
vD4 FI MA8 
vD5 Ft MA7 
vbe FI MAG 
vb7 FT MAS 
vbs FT MA4 
vba FT MA3 
vDi0 & FI MA2 
vDi1 & FI MAI 
vbDi2 4 . = Mig 
VDDI FI RAS1# 
saione Video CODEC = CASte 
SAIOW# & VSsI 
VSSI HI  SAADDRi 
VD13 & (1 SAADDRO 
vDi4 VDDI 
vDi5 & FI WE1# 
HRESET# & FI (OE1# 
VRESET# FI — RASO# 
ACTive 4 = CASO# 
BTEN# 1  WEO# 
BTFLAG# OEO# 
EINTi# 4 CLK_MEM 
Dvalid — ADO 
CLK_Video & AD1 
VDDB AD2 
SADATAO-4 AD3 
SADATA1G] VSSB 
VSSB FI SADATA7 
INTA# HI SADATAG 
RST# & VDDB 
CLK & FIT AD4 
GNT# & FI ADS 
REQ# & FI AD6 
AD31 FI AD7 
AD30 I = C/BEO# 
VSSI VSSI 
SADATA2 4 SADATA5 
EINT2# = 1 2 2 3 3 4 4 VDDI 
5 5 0 5 0 5 0 5 
UM UU UU UU UU EU UU UU UU UU UU UU UU UU ddd 
VNVAAAAAAVNNVCLAAAAAAAAVSSVCF) TDSPSPCVNN VAAAAAAAAV NV 
DCSDDDDDDDccS/2DDDDDDDDDAAS/ RRRETEEA/ DCCSDDDDDDDDSCD 
D~S$222222D S$BS599221111DDDSBADDVORRRBD $11111198S D 
| B987654B BEE32109876!AAIlEMyYSPRR EB B543210 BO5 
3l TT 2Eyg#E##H# 1 Vv 
# AA ## L # 
34 # 
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3. FUNCTIONAL DESCRIPTION 


3.1 W9960CF Architecture 


W9960CF 


W9960CF is composed of a high performance RISC processor core (VRISC), function blocks for 
video encoding/decoding and a downloadable program memory such that the system can be run- 
time configured for a variety of video applications. The figure is a block diagram for W9960CF. 


PCI_BUS 


ISDN 
Modem 


Graphics 
Adapter 


QO 
(2) 
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RISC 


PCI Memory 
DMA Controller 


PCI Bus 
Interface 


Motion 
Estimation 


DCT/IDCT 
/| 


VLE/VLD/BCH 


DRAM 
Controller 


Video Pre/post 
Processor 


Filter 


Audio Port 
Controller 


ISA-Like 


Timer/GPIO 
Interrupt Controller 


4 
ine) 


DRAM 
Video 
Decoder Camera 
Speaker 
DSP 
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3.2 PCI Interface 


W9960CF provides PCI master/slave Interface. In the master mode, it supports fast DMA data 
transfer for video/audio bitstream, picture direct draw to graphic display device and VRISC 
firmware download. In the slave mode, there are two Base Address Registers for W9960CF 
internal registers and external DRAM accessing by the host processor. All data accessing, except 
for configuration registers, should be word (2-byte) or double word (4-byte) read/write operations. 


PCI Interface 


() 


Internal CPU Bus 


Baul ea Internal XDMA Bus 


External PCI Bus 


Device ID Vendor ID 


er 


Base Address Register 0 (BARO) 


14H Base Address Register 1 (BAR1) 


18H - 38H Reserved 
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3.3 VRISC 


VRISC contains a 4-staged pipeline: Instruction Fetch (IF), Instruction Decoding (DEC), 
Operand Execution (EXE), and Result Write-Back (WB), and a 16-bit ALU with Integer 
Multiplication and Division. It uses a built-in 1K-byte Data RAM and a 4.5Kx22 bit Instruction 
RAM to which the firmware program can be downloaded from the host processor. This VRISC 
contains a tri-port (2-read/1-write) 32x16 bit Register File and 32 interrupt vectors. 


RISC Bus 


Program 
int! ——>| Interrupt Counter 
. — 
int8 ——>| control control 


Instruction 


Fetch Instruction — 
RAM contro 


Decoding Inst. Reg. 


Decoding J 


z E . Inst. Reg. 


| r| Arbiter 
Write WB Inst. Reg. I L__] 
Back WB Reg 


Address Spaces 


VRISC has two address spaces: the Program Address Space and the Data Memory Address 
Space. The following figure illustrates the layout of the booting address, interrupt vector address 
space in the main program address space. Program always start from booting address ( OOOOH ) 
after the host enables VRISC. Address 0001H through 001FH stores Interrupt vector for 
Interrupt service routines. Figure 2 illustrates the layout of the registers, Data Memory (DM), 
external DRAM address space in the main data memory address space. Address 000000H 
through 0001FFH is for VRISC Register File referencing, address 000200H through 0003FFH is 
for internal DM memory accessing, and address 000400H through 1FFFFFH is for external 
DRAM accessing. VRISC cannot access the external DRAM from address O00000H through 
OO0O3FFH since this address space are reserved for internal registers and DM memory. 
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0000H 
Program 
Address p001n 
Interrupt 
Vector 
Address 
Space 
001FH 


Program Addressing Space 


Enai 000000H 
ngine 
Register 
Address 
Space 
ial2) 0001FFH 
DM 000200H 
Space 
emory 
Address 
000400H 
1FFFFF 
DRAM 
Address 
Space 
1FFFFFH 
Data Memory Addressing Space 
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Addressing Mapping 


VRISC has a 5-bit segment register, DMSA. The 21-bit memory address is composed of the 5- 
bit content of the DMSA and the content of one of the 32 16-bit general registers as specified in 
the Load and Store instruction. The segment register provides a solution to extend the address 
space from 64K to 2M bytes. DMSA is a write-only register which can be programmed by the 
SEGS imm65 instruction (see VRISC Instruction Set). 


Data Memory Address Mapping 


General Registers (R0O..R31) 


VRISC has 32 16-bit general registers to provide the resource for all computation. They are 
numbered as RO through R31. RO delivers zero when referenced as a source operand. When RO 
is used as destination, the result is discarded. 


Interrupt Handling 


0000h main program starting address 


oooth_| MERDY 
0002h FRDY FILTER FILTER ready interrupt 
0003h TendINT DCT/IDCT IDCT ready interrupt 
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en) pe 


0004h RISCINT DCT/IDCT DCT ready interrupt 
E 


000s XDMA 
oo06h Ext. Interrupt 
0007 VLE 
oo08h TIMER 
000sh TIMER 


000A TIMER 
0008 TIMER 
oo0ch HOST 
0000 VLPIO 
000E VLPIO 
000F VLPIO 
0010h FILTER 
001th ME 
0012h ME 
0013h FILTER 
001 4h DCTIDCT 
0015h DCTIDCT 
0016h DCTIDCT 


oo1gh | drequ | VPRE 
oo1gh_| dreqy | PRE 
00th vPOsT 
0018 vPOsT 
001Ch vPOsT 


001Dh DREQ_ENCF VLPIO DMA TC int. for Encoding bitstream 


0017h VPRE DMA TC int. for V block of Capture-in 


001Eh DREQ_DECF VLPIO DMA TC int. for Decoding bitstream 
001Fh DREQ_IPTF VLPIO DMA TC int. for bitstream from PCI FIFO 
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VRISC provides 32 vectors on the top of the VRISC program space for jumping to interrupt 
service routines. These 32 interrupt jump vectors consist of 15 engine interrupts (0001h~O00Fh) 
and 16 DMA TC interrupts (0010h~001Fh). The 0000h vector jumps to the main program 
starting address. The following example describes the interrupt handling flow: 


e VRISC Start : VRISC Program Counter (PC) always starts from address 0000h after the 
host processor enables VRISC. The content of address 0000h should be a JMP 
instruction to jump to the entry point of the main program. (path1) 


e Engine Interrupts : When VRISC receives an engine interrupt, Program Counter assumes 
the value of the interrupt vector address according to the interrupt happening (path2). 
The processor then jumps to the interrupt service routine (path3). When an interrupt is 
happening, VRISC disables other interrupt inputs and stores the current fetch address at 
IF stage, current instruction at DEC stage, and current status at EXE stage to shadow 
registers. They will be restored after VRISC finishes the interrupt service. 


e Interrupt Service Routine : The IVEC (Interrupt Vector) register has to be read out to 
generate an acknowledge signal to the interrupting engine. An El instruction should be 
used at the bottom of the service routine to enable other interrupt inputs again. The last 
instruction of a service routine is the RET instruction, which restores all status from the 
shadow registers and then the VRISC resumes the original program flow (path4). 
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RISC start 


Jmp Int1 
Jmp Int2 


Jmp DMAint13 
Jmp DMAint14 
Jmp DMAint15 


VRISC Shadow Registers 


VRISC Shadow Registers are used to store the CPU status when VRISC runs into a Call 
instruction or an Interrupt. All registers in this group can only be accessed by VRISC. Host 
cannot access registers of this group. 


008H RW 
00BH RIW | | 


high-order bits of Instruction Return Register 
00CH TEMPRES_H PR ntghorder bits of TEMPRES Register 


OODH TEMPRES _L  _Niowedrder bits of TEMPRES Register 


Winbond Confidential 19 June 1997 


W9960CF 


Winbond 


Electronics Corp. 


3.4 Frame Memory DMA Controller ( FDMA ) 


There are 16 FDMA channels which are used for direct memory accessing between frame 
memory ( DRAM ) and the engines. The transfer type can be either Demand or Block Mode. 
Each transfer type can be either Linear or Blocking Addressing. The programmer needs to 
specify the picture size with PH/PW register, transfer size with the EH/EW register, the picture 
starting point with FMSA register and engine starting address with ESP register. 


FDMA generates a TC signal to interrupt VRISC when DMA service is completed. When TC 
interrupts VRISC, the DMASK bit is cleared automatically. The FDMA requests are queued 
when FDMA is busy. The software FDMA has the highest priority. 


FDMA channel assignment is as follows: 


Channel| DMA request | Engine |direction| u | LIN |dma|Rw| Description 
| o | Forea | Fiter | me |u| | | w| blockintormc 
me | mse |u| | | w | block infor Search Window of ME | 
| 2 |preacexk| we | mse | | | | w | blockintor Curent Block of ME 
| 3 | roorea | iter | e>m | | | | | blockouttorBy-pass Fier 
| 4 | orapwan |octict| m>e | | | | w| blockinforoct 
| 5 |prainctr p|octipcr| e>m | |B | block out for Decoder Re-Construct 
| 6 |prapcrr e|pctipcr| e>m | | F_| block out for Encoder Re-Construct 

| dteqv | video in| E>m |_| | | Chrom Crof Video Capture | 
| 8 | drequ | videon| e>m | | |dmd] p | Chrom Cb of Video Capture 


| 10 | dregy [video out] m>e | | dma] w | tumyofDisplay 
| 11 | dred [video out] m>e | | lama] w | chrom bof Display 
| 12 |  dreqv [video out] m>e | | lama] w | Chromcrof Display 
| 13 |oreqencr| pio | m>e | [un |dma| w | Encoder bitstream out 
vse | [un |ama| w | Decoder bitstream from FM to VLD| 


15 DREQ_IPTF E>M LIN | dmd Incoming decoding bitstream from 
PIO input FIFO to FM 


fo | aeqy [video in| e>m | | [dmal a | tumvofvideo Capture | 
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Block Diagram 


DMA_BUS 
3.4.1. BUS ARBITRATION 


PCI bridge has the highest priority of bus access, VRISC has second priority and FDMA has the 
lowest priority. PCl issues CBR_(CPU Bus Request) to get memory bus when VRISC is working. 
Also, VRISC can interrupt FDMA by issuing a MBR_(Memory Bus Request) signal to get access 
to memory bus. 


3.4.2 FDMA TRANSFER TYPE 


The FDMA do Unrestricted Mode when the start point of picture is out of picture boundary. 
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BLOCK Addressing Linear Addressing 


7 = Be 


Filter ge PIO 
DCT/IDCT ~ 


3.4.3 FDMA PROGRAMMING 
1. FDMA has two addressing modes: Linear and Block Addressing; and two transfer modes: 
Block and Demand Mode transfer. 


2. Unrestricted mode is only supported by chanenlO through channel3. So the PSP ( 
Picture Start Point ) can be of negative value. 


3. Block addressing mode : The following is the relation between DRAM address and Engine 
address 


FMSA = 100) 

finit = 1023 
102 
103 
103 


DRAM Picture Engine DRAM Picture Engine 
Address _Y_X x Address _Y_ X Xx 
23 3 


1036 
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4. Program FDMA control registers. 
a. MODE register : LIN=0 ( block addressing ), PP(2:0), Eph(3:0), Epw(3:0), Dmd(0), RW_ 
b. Transfer size register : EW( EPw(3:0)), EH( Eph(3:0)), 
transfer size = ( EW+1) * (EH +1) =( 341) *(34+1)=16 
c. Picture size register : PW( PP(2:0)), PH( PP(2:0)) 
d. Frame memory starting address : FMSA = 1000 
e. Picture Start Point : PSPy = 2, PSPx = 3, 
f. Start to calculate finit=( PSPy * (PW+1) + PSPx ) + FMSA = (2 * (9+1) +3) +1000 = 1023 
g. Engine start point : ESP =( 1,1) 
h. Enable DMASK 
5. Linear addressing mode 


DRAM 
FMSA=100 


ENGINE 
finit=1015 


PSPy=0 
PSPx=15 


1115 


6. Program control registers. 
a. MODE register : LIN=1 (linear addressing ), Eph(3:0), Epw(3:0), Dmd(1), RW_ 
b. Transfer size register : EW( EPw(3:0)), EH( Eph(3:0)), 
transfer size = EH x 2**9 + (EW + 1) =0 x 2* *9 + (100 + 1) = 101 
c. Frame memory starting address : FMSA = 1000 
d. Picture start point : PSPy = 0, PSPx = 15, 
e. Start to calculate finit=(PSPy*( PW+1) + PSPx) + FMSA = (0 * (9+1)+15)+1000 =1015 
f. Engine start point : ESP = (0, 0 ) 
g. Enable DMASK 


7. In demand mode, the DMA service will pause if DMA request becomes inactive, and the 
service will continue if DMA request is active again. 


8. FDMA still transfers two sets of data after DMA request becomes inactive in demand mode. 
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3.4.4 FDMA ADDRESSING REGISTERS 


Each channel has 4 addressing registers: Engine Start Point Register, Picture Start Point 
Register of X_axis, Picture Start Point Register of Y_axis and Frame Memory Start Address 
Register. 


Picture 


FMSA 


FMSA+[(PW+1)(PH+1)-1] 


FDMA Registers List 


PSPx0-15 
| R | Frameinitial vauel 


PSPy0-15 Picture Start Point of Y-axial 
| oR Frame initial value-H 
0070H - 007FH,/ 01COH - 01FCH FMSAO-15 Frame Memory Start Address 
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RISC Address/PCl Offset Address Read/Write 
0030H/O0COH DMA_Index FDMA Index Register 
0031H/00C4H DMSK FDMA Mask Register 


0032H/00C8H SDMA Software FDMA 
0033H/00CCH DSTS FDMA Status Register 


0034H/00D0H | ors | rR _| TC Status Register 
0035H/00D4H TCMSK TC Mask Register 


Reserved 


0037H/00DCH DMA_HW Height/Width Register 
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3.5 External Memory DMA Controller ( XDMA ) 


There are 8 channels in XDMA which is used for direct memory accessing between PCI bus and 
internal engines. While XDMA has DMA request, PCI interface will enter PCI master mode to 
issue master cycles in PCI bus. The DMA transfer type supports both Demand Mode and Block 
Mode, each with Linear Addressing or Blocking Addressing. Channel #0 and #1 are used to 
transfer the remote and local picture out to system memory or graphic display device directly. 
Channel #2 and #3 are used to transfer audio bitstream to external DSP coprocessor. Channel 
#4 and #5 are used for video bitstream encoding and decoding. Channel #6 is used for firmware 
downloading. Channel #7 is for verifying half-pixel search window memory. Channel #7should 
not be activated in normal operation. The programming sequence is like the following: 


1. Setting DMA start address : programming XMSA register ( External Memory Start 
Address register) with 32-bit access. The address should be double-word (4 bytes) 
aligned. 


2. Setting DMA transfer mode : each channel has its own transfer mode register. The 
register is accessed by two steps: set index value into XDMA Index register and then 
write the data value into Height/Width register to set the Mode register (Such as 
Linear/Block Addressing, Demand/Block Transfer, direction, and size index) 


3. Setting DMA transfer size : the transfer size is defined by EH and EW registers. For 
Block Addressing Mode, the transfer size is (EW+1)x(EH+1)x4 bytes; for Linear 
Addressing Mode, the size is generated by concatenating EH and EW registers, i.e. the 
transfer size is ({EH[8:0], EW[10:2]}+1)x4 bytes. XDMA provides 8 sets of EH and EW 
registers as indicated by each XDMA channel from the Mode register setting. The 
programming method of EH and EW register is the same as in programming Mode 
registers. 


4. Setting frame size : XDMA , in Block Addressing Mode, refers to frame size of external 
memory to generate the block address. XDMA provides 8 sets of PH and PW registers 
to define the frame size (PW+1)x(PH+1)x4. 


5. Setting XTC Mask register: XTC Mask register indicates XDMA assert TC interrupt or 
not. 


6. Setting XSDMA or XDMSK register: Host and VRISC can program XSDMA register to 
trigger software DMA operation. XDMSK register is for enabling hardware triggered 
DMA operations. Hardware engines can issue DMA requests to XDMA to trigger DMA 
when the corresponding bits in the XDMASK register are set. 


7. Read XDTS register: while DMA operation has been completed and XDMA issues a TC 
interrupt to host or VRISC, the interrupt service routine has to read the XDTS register 
(TC status of XDMA) to clear the TC flag, so that the XDMA can continue with the next 
DMA operation. 
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XRadrq VPOST E>PCl Pal dmd |Remote Video out 
1 DRQVPOSTX |_ VPOST E > PCI dmd |Local Video out 
XDREQ_RX Audio E>PCl dmd |Bit-stream for Audio out 


XDREQ TX Audio PCI>E | W_ | LIN | dmd |Bit-stream for Audio in 


2 

3 
| 4 |xoreqopte| pio e>rci| R | Lin |dmd|video Encoding Bitstream output__| 
| 5 |xoreqipre| pio | pci>e| w | uN | dd video Bitstream for Remote Data in _| 


CPU_PM PCI>E | W dmd |Firmware download to PM 


Le CPU 
| 7 |oackisw{] ve e>pcil a | - | * |pait pixet Search window (for Testing | 


XDMAC Registers 


R/W 
R/W 


003BH/00ECH XDSTS | Rw XDMA Status Register 
003CH/O0FOH XDTS | ln TC Status of XDMA Register 
003DH/00F4H XTCMSK | Rw 


TC Mask of XDMA Register 


003EH/O0FSH || Reserved 


003FH/00FCH XDMA_HW XDMA Height/Width Register 
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3.6 DRAM Memory Interface 


W9960CF provides a 32-bit DRAM data bus for DMA data transfer and VRISC access. It 

supports the control timing for Fast Page Mode or EDO DRAMs. The DRAM address space has 

three types of configurations: 1M, 2M, and 4M bytes. For 1M and 2M-byte space, the DRAM 

devices must be two or four 256Kx16 DRAM devices. For 4M-byte space, it must be two 1Mx16. 
1ST MBYTE 


2ND MBYTE 
| 


"Tete el ste 


D[15:0] 


D[15:0] 
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Timing 


@ Normal Case 


CHET TE TT 
SE Oe Le 
Bee 


Read from PD bra | 
— 


OE# 


PL 
=| 
aa 
| | 
pesca 
| 
Write to DR4 St 
MD[31:0, 


WE# 
mrdy_ 
edowr. 

mrdy_ 

(Fast mode 
edowr_ 


@ At EGF4 Mode (EDO-60, 40MHz 


ALU UE | UY 
HL We ey 
rUckL lece | lie 
rlcleL leece | lac 
TIceL_ lee | ict 
Tce L Jee | Le 
rt Mee | TA 
a 
A Tat A 


1X Column ) 


umn 


ol 


1X Column | 


RAM 


Write to DRA 


Read from D 


CLK 
MCS# 
RAS# 
CAS# 
ADD[9+0 
WE# 
MD[31:0' 
mrdy_ 
edowr 
OE# 
MD[31:0' 
mrdy_ 
edowr_ 


June 1997 
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3.7 INTERRUPT/TRIGGER Controller 


This controller provides 10 different Interrupts to VRISC and _ 5 triggering signals for VRISC to 
trigger engines. 


Trigger #0 is used for Video Capture triggering. Trigger #1 is used to start ME motion estimation 
search. Trigger #2 is used to start FILTER block operation. Trigger #3 is used to trigger IDCT to 
start decoding inverse discrete cosine transform operation. Trigger #4 is used to trigger DCT to 

start encoding forward discrete cosine transform. 


Interrupt #5 is XDMA Terminal Count Interrupt. Interrupt #6 is the interrupt coming from ISA-like 
external interface. Interrupt #7 indicates the VLE FIFO full while T-coeff. encoding. Interrupt #8 
and #9 are TIMER TR (Temporal Reference) Interrupts for decoding and encoding. Interrupt #10 
and #11 are the TIMER time out interrupts. Interrupt #12 is used for host to interrupt VRISC. 
Interrupt #13 indicates VLD operation is completed after VLD command register is triggered. 
Interrupt #14 indicates the PIO BCH code is not aligned with frame. Interrupt #15 indicates a 
Run-Level Block Error in VLD decoding. 


All interrupts can be enabled or disabled as specified by the mask bits of the IMSK register. It will 
response which channel is active on ISR register and generate an INT to VRISC. When VRISC 
enters an interrupt service routine, it has to read out the IVEC register to have Interrupt 
Controller assert INTA to clear interrupt status. 


0 [tric] [Capture Trigger| VideoPre |Video Capture trigger | 
Trigger Filter 
Trigger !DCT 
sia] int || xo | XDMATC Interrupt 
6 linta| —extint | tS Atike | ISAExteral Interupt | 
7 imal weir | | werco | WeEFIFO Fu inerupt | 
3 imal orainr | | TIMER | Temp. Ref, Interrupt (Decoder) —_—_| 
9 |p| erainr | | TiMER | Temp. Ref, Interrupt (Encoder) | 
10 [ira] touro | | riven | Timer dinterupt 
1 |inta| tours | | timer | timer tinterupt 
12 [ral pont | | Host | Hostintoruptrisc | 
13 [rp] wroy nt | tata | PIO | vidisoer 
14 _[ivtR|UFRAME INT|UFRAME INTAL PIO | Frame Unock Interrupt | 
VLD Run Level Block Error 
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Block Diagram 


INTG Registers 


001CH/0070H TMOD Trigger Mode Register 


Software Trigger Register 


001DH/0074H STG 
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3.8 X_INTERRUPT Controller ( XINTC ) 


XINTC provides 16 channels interrupt trigger source and generates INTA# to PCI_BUS. 
Channel #0~#7 are used for XDMA requests. Channel #8~#11 are reserved for VRISC to issue 
interrupts to host. Channel #12 is for the ISA-like external Interrupt. Channel#13~#14 are the TC 
interrupts for XDMA and FDMA. Channel#15 is the interrupt from INTC controller. All channels 
are maskable by XMSK register. Host has to read the XSTS register to identify interrupt source 
when receiving a W9960CF issued interrupt. Host issues interrupt to VRISC by programming 
PCI_INT register, which will generate a interrupt trigger pulse for VRISC to enter interrupt service 
routine. 


xiINT IN| 


2} to xdreqO dreq of XDMAC 
ent 
se 
sed 
tt 
| xdreqs: | 
-— | xdreq6 
lL seciraar | 
ee 


ae ed ree eee 

| to | it | Reserved for VRISC) 

poi ft | Reserved for VAIO) 
13 tc of XDMAC 


XINTC Registers 


RISC Address/PClI Offset Address Read/Write 
0006H/0018H XMSK X_Interrupt Mask Register 


0007H/001CH XSTS oR X_Status Register 
PCI_INT PCI_INT Command 
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3.9 GPIO ( General Purpose Input/Ouput) Port 
W9960CF provides 4 pins as a GPIO port. These four pins are programmable to be input or 
output. GPIOO and GPIO1 are two open drain IO pads and pull-high resistors are necessary in 


application circuits. GPIO2 and GPIO3 are tri-state |O pads. GPIO port is used to connect to 
external devices such as analog video decoder and/or audio coprocessor. 


wr_(CPU Bus) |_| L_| L| L_| 


rd_(CPU Busy o_o 
GPIO[3:2] 7X9 <> He@$ <9 
GPIO[1:0] 


3.10 TIMER 


W9960CF provides a programmable pre-scale counter (PSR register) for different Video Clock 
input and three period registers (TPO~TP2) for period setting of the temporal reference, time 
counter and DRAM reference. W9960 also provides two TR reference counters for decoding and 
encoding (DTR and ETR registers). Here is the programming example: 


if the video clock (CLK_Video pin) is 13.5MHz, set: 
RC2~RCO to 010b, 


P3~PO0 to 0011b 
TPO= 6DF8h (29.9701 6frame/sec), 83D5h (25.00 frame/sec), 


TP2=000Dh; 


If the Video clock is 27MHz, set: 
RC2~RCO0= 010b, 


P3~P0=0100b, 
TPO= 6DF8h (29.97016frame/sec), 83D5h (25.00 frame/sec), 


TP2=000Dh. 


The frequency calculation statement is as: 
Fre. = VCLK / ( (2**“*” )*(TP+1) ) 


where VCLK is input frequency of CLK_Video pin, N is the value of RC[2:0], and TP is the 
period setting of TPO, TP1, or TP2. 
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Block Diagram 


Pre-Scaler 
(2~/2 )'° 


Counter Counter 

(8 stages) (8 stages) 
TP 
register 


16 


TOUT Interrupt 


TIMER Registers 


crate on naoe | Nne [eee] 
P—mniom 
Te imeem 
Tames 
ee 


013H / 04CH | OTR 


Timer Period Register #1 
Timer Period Register #2 
Temporal Reference of Encode Register 


SR 
TRO 
TRI 

TR 
TPO 
TP1 
TP2 
ETR 
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3.11 Video Pre/Post Processing Engine 


The functions of Video Preprocessing engine are capturing, cropping, zooming video in 4CIF, 
CIF, QCIF, Sub-QCIF resolution. Actually, user can command this engine to capture image of 
any size within 704 x 576(PAL) or 704x480(NTSC). Video Preprocessing Engine also provides 
two programmable decimation horizontal filters for luminance data and chrominance data to 
smooth the captured image. 


Video Postprocessing Engine supports both RGB 5:6:5 and YUV 4:2:2 for both local and remote 
video output. RGB format is for normal VGA card without color space conversion. YUV format is 
for those video cards with color space conversion and scaling support. 


3.11.1 VIDEO PREPROCESSOR (VPRE) 


The Video PreProcessor (VPRE) transfers the captured video data to frame buffer DRAM 
through horizontal filters, horizontal subsampler and FIFO for DMA data transfer. The size of the 
Y FIFO is 44x4 bytes. U FIFO and V FIFO is 22x4 bytes each. The block diagram is shown 
below: 


D_Bus 


FIFO 44x32bits —-Y 


YUV422 
signal 
from 
Video Decoder 


ScaleH ; 
(1, 1/2, 1/4) FIFO Control 


/OddScaleV Full 
d, 1/2, 1/5) (1, 3/4, 2/4, 1/4 ) 


Bt/Ph_ 


taps filter 


Cap_trig 


Video PreProcess Diagram 


The following is the register parameters used to configure for different formats: 


H V (HxV) 
a 
| CIF Z1fr4CIFP 
/4 1 


t| 1 | 


144 
144 
+ ft | ot | ros [sre | ace | 


ed 

ia IF 

ia CIF Zl fr 4CIF ,P 
ai 

| 12_| 


1 
1 
1 
1 
/2 
1 
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| | 704 | 28 | on | of | we | 1 | of | ase | one | cm | 
| fase | ass | on | on | 1 | ve | of | ase | oes | cr | 


| | ase | 144 | on | oft | ve | 1 | off | 176 | 14 | ace | 
|| 704 | 28 on | ot | ve | ve | of | re | | ace | 


3.11.2 VIDEO POSTPROCESSOR (VPOST) 


VPOST provides an option of 1/2 scaling-down for both local and remote picture. It also supports 
color space conversion for either YUV422 or RGB565 video output. VPOST provides FIFO for 
DMA data transfer. The size of Y FIFO is 16x4bytes. U FIFO and V FIFO is 8x4 bytes each. 


Oox4byte 

BUFFER 
Colour e 
Space 


Convert 


Video PostProcessor Diagram 
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VIDEO Register 


[ascrarearoronat sass | vane [esac | oe 


015AH/0694H LFP R/W Luminance filter parameters, Tap1 and 
Tap2 

01A6H/0698H CFP R/W Chrominance filter parameters, Tap1 
and Tap2.Luminance decimation filter 
parameter Taps. 


01A7H/069CH VPOST Mode | BIW _|VideoPostprocessor operation mode 
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3.12 Motion Estimation Engine 


W9960CF motion estimation (ME) engine implements full search matching algorithm (FSA), 
which is widely used thanks to its simplicity and regularity. In this algorithm, for each reference 
block in the current frame, the previous frame is searched within a neighborhood, i.e. search 
window, to find the most matched pixel block. 


DMA-request 
CONTRQ 
LOGIC DMA-ack 


Half-Pixel 


INT-request 


Trigger 


— > MV 
MAD,MA,MB,MX 


— 
Fave 


ME Registers 


0190H / 0640H MEMODE R/W Mode Register 


O191H / 064aH 
0192H / 0648H MADR Mean Absolute Difference Register 


RISC Address/PCl Offset Address Read/Write 


0193H / 064CH Mean Current Macro Block Register 
0194H / 0650H MAR Mean Average Difference Register 


0195H / 0654H | owxre | Rw | Mean First Search Register 


R R/W 
0196H / 0658H SRGION Search Region Register 
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3.13 FILTER Engine 
Filter Engine implements the loop filter function of H.261. It is a two-dimensional spatial filter as 


shown below. The Filter Engine also performs the interpolation function of half pixel prediction 
and PB frame prediction in H.263. 


H.261 8x8 predicted Block with 1-2-1 Filter 


PT EPEEEE] fe 
fs Jo |i] 12 [13 | 14 fs | (1) 1/16 242 for pixels inside the block 


(2) 1/16 |484]| for pixels on the block edge 
(3) 1/16 | 16 | for pixels on the block corner positions 
ise [sr [ss ]e0 | co for [oo | os 
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H.261 1-2-1 Filter Table 


121 121 121 121 121 121 
242 242 242 242 242 242 
121 121 121 121 121 121 
121 121 121 121 121 121 
242 242 242 242 242 242 
121 121 121 121 121 121 
121 121 121 121 121 121 
242 242 242 242 242 242 
121 121 121 121 121 121 
121 121 121 121 121 121 
242 242 242 242 242 242 
121 121 121 121 121 121 
121 121 121 121 121 121 
242 242 242 242 242 242 
121 121 121 121 121 121 
121 121 121 121 121 121 


242 242 242 242 242 242 


eae 121 121 121 121 aa 


rr rr 


Block Diagram 


The prediction picture is transferred into Filter Engine by DMA operation. Half Pixel 
PreProcessor module performs the half pixel interpolation in x-axial direction. HPP module 
performs the half pixel interpolation in y-axial direction. OMC module performs PB frame 
prediction. The control flow is specified in FCRO (Filter Control Register #0) and BRR (Bi- 
direction Range Register) registers. 
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Input from DRAM Controller 


Half Pixel 
PreProcessor 


\ 


To DCT/IDCT Engine 
or DRAM Controller 


Filter Registers 


RISC Address / PCI Offset Address Read/Write PO 


018CH / 0630H FCRO Filter Control Register #0 
018DH / 0634H FCRI1 Filter Control Register #1 
018EH / 0638H Bi-direction Range Register 
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3.14 FIDCT/Q/IQ Engine 


W9960CF contains Forward/Inverse Discrete Cosine Transform Engine (FIDCT) and 
Quantization /Inverse Quantization Engine, which are frequently used in video 
compression/decompression. 


For video encoding, DCT is used to reduce spatial redundancy of images. The quantization logic 
further filters most AC values and keeps the DC value. 


In video decoding, each 8x8 block is transformed back by Inverse Quantization. This |Q process 
induces quantization error in AC values compared with original data, but the DC value is 
recovered losslessly. Thereafter, data is sent to Inverse Discrete Cosine Transform stage to do 
image recovery process. In the INTRA mode, the result is final reconstructed image. While in 
the INTER mode, the result has to be added with the previous block data to reconstruct the 
image. 


Block Diagram 


From Filter 


RE-Construct 


BUFFER | 
I 


Current Block 


ca 
2) 


IDCT 


Zig/Zag 


From VLD BUFFER 
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DCT/IDCT Register 


RISC address/PCl Offset Address I= “Nariee- I Read/Write 


0180H/0600H Q_Control | pw [Quantization value selection 


0182H/0608H CBP threshold | pw |cpp threshold value 


0183H/060CH CheckSum DCT result checksum 


0184H/0610H EQuant value | BAW [Encoding loop Quant value 


0185H/0614H Decoding loop Quant value 


0186H/0618H EBD Quant value RAW - [Encoding loop BQ/DQ value 
0187H/061CH DBD Quant value | Rw [Decoding loop BQ/DQ value 
R/W 


| Rw | Intra/inter mode, coding/decoding 


0188H/0620H CODEC 
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3.15 Programmable Input/Output Engine 
Programmable input/output (PIO) engine can control the input bit stream data from X_bus or 
CPU_data_bus to the variable length decoder (VLD). PIO plays the role of interface between the 


frame memory (FM) and X_bus during bitstream receiving for decoding and bitstream 
transmission after encoding. 


Block Diagram 


FIFO 


DECODER i\ 
FIFO 


OUTPUT 
DATA 
FIFO 
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PIO Registers 


| orapHiosacH | xomar | RW | x_Bus DMA RequestEnable Register 


orc ossoH | roar | RW __| Frame Memory DMA Request Enable Register 
orpn/ossa | rc |W | Frame Code Register 
ome oessH | po |W __| Lock Position Detect Register 
__otarH/osecH | torsr | RW __| inputiOutput FIFO Status Register 
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3.16 Variable Length Code Decoder 


The Variable Length (VL) Engine uses the code books or lookup tables to map the Huffman code 
words. Huffman coding is a kind of entropy coding method used in H26X. Image or video data 
size can be reduced significantly when proper Huffman tables are used. Not only the variable 
length codes but also the fixed length codes can be decoded in this engine. Bit streams coming 
from the programmable in/out port (PIO) are fed into VL engine during decoding process. 


VLD Register 


| ores /o6coH | MCRL | RW __| Match Code Low Word Register _ 

| onsen /osea | oR H |W __| Match Code High Word Register _ 
01BAH / O6E8H / ousr | Rw Match Size egister 

a ogee ee eee 

| orpcH/osroH | ures | oR | i ResutRegister 


01BDH / O6F4H VLRES_LW | oR VL Result Low Word Register 


RISC Address / PCI Offset Address Read/Write 
01C0OH / 0700H VLDTCO VLD TCoefficient Register 0 


01FFH / 07FCH VLDTC63 VLD Tcoefficient Register 63 
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3.17 Audio Coprocessor Interface 


W9960CF provides an ADSP-21xx SPORT compatible serial port to interface to audio 
coprocessors. Multi-channel mode in AD SPORTO is provided for interfacing with other chips 
time-division multiplexing (TDM) function. Transmitter FIFO and Receiver FIFO are used since 
serial one-bit bit stream is packed into 32-bit word, and no CPU instructions to access Tx or Rx 
registers. This T/R FIFO should access with frame memory via 32-bit X_bus. No compounding 
function is provided with, since no management for u-law or A-law data format. 


Block Diagram 


Serial 
Control 


Transmit 
Shift Reg 


AUDIO PORT Register 


i 
/W 


Transmit Word Enables for Low Word Register 


019DH / 0674H TWE_L 


019EH / 0678H TWE_H Transmit Word Enables for High Word 
Register 


019FH / 067CH TRBDR | oR | Transmit/Receive Buffer Detect Register 
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3.18 Variable Length Code Encoding (VLE) Engine 


There are two modules in VLE Engine, one is for Tcoeff (Transform coefficient) encoding and the 
other is for variable length encoding. The Tcoeff is based on the run level pair generated from 
the result of DCT/Quantization and pushed into the 16x32-bit Tcoeff_FIFO. When DCT/Q is 
completed, VRISC shall check some conditions such as checksum register and determine 
whether to trigger RLC ( Run Level Code ) to generate transform coefficient. When all blocks is 
encoded into Tcoeff_FIFO, VRISC shall read Bit Length Register to check how many bits are 
generated (overflow condition if Bit Length Register is over 512, busy condition if Bit Length 
Register is Zero). Then VRISC read out the Tcoeff value from Tcoeff Result Register. VLE_INT 
will assert when the coding bit length is over the threshold of Tcoff_FIFO. VRISC can also write a 
command including data and table-indicator into Command Register to generate variable length 
code ( except Tcoeff ). 


Block Diagram 


H261/H263/MPEG/JPEG 


DCT_Trig 
DCT_rdy 


VLE Command Register 


VLC Table 


Tcoeff-VLE 
16 x 32bit 
FIFO 
VLE Result Register Bit Length Register 


Tcoeff Result Register 


DCT/QUANT 


LC RLC_Trig 


( Tcoeff FIFO overflow ) 
VLE_INT 


VLE Register 


|_| BitLength Register 
|__| Tooett Result Register | 
| R_ | VieResut Register 
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3.19 ISA-like Interface 


The ISA-like Interface implements a parallel I/O port to connect to co-processors, and a Boot- 
ROM mechanism to download VRISC firmware automatically. 


This ISA-like parallel I/O port provides an 8-bit data bus and 16-bit address bus to connect to co- 
processor easily. The host can access the connected co-processor through W9960CF PCI 
interface and this ISA-like interface. In this sense, W9960CF acts as a PCI to ISA bridge. It also 
provides two external interrupt requests of level or edge trigger. 


This ISA-like Interface provides another path to download VRISC firmware in addition to the PCI- 
bus interface. In Boot-ROM mode, the ISA-like interface will issue Boot-ROM access cycle to 
download firmware after W9960CF is reset. 


W 9960CF 


PCI ISA-Like 


Interface Interface 


[Internal Bus 
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ISA-Like Interface Resigisters 


RISC Address/PClI Offset Address Read/Write 
020H/080H iSACTL__| RAW _| Setting ISA control operation register _ 
021H/084H SAADDR | Rw Setting ISA output address register 


R/W 
R/W 


Programming Sequence 


W9960CF assert I/O read/write cycles on ISA-like interface accroding to the following 
programming sequence. 


IO Write Cycle IO Read Cycle 
Store ISACTL register Store ISACTL register 
(set WE bit enable) (set RE bit enable) 
Store ISAADDR register Store ISAADDR register 
(set effective address) (set effective address) 


Store ISADATA register Insert Delay 

(set effective data) (7-14 clock cycles) 
Load ISADATA register 
(read effective data) 
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4. W9960CF REGISTERS 


4.1 PCI Configuration Registers 


ce 


Class Code Revision ID 


a Base Address Register 0 (BARO) 
Base Address Register 1 (BAR1) 


18H - 38H Reserved 


Device/Vendor ID Register 


Read-only PCI Configuration Address: 00H 


Default: 9960 1050H 


_31_| 30 | 29 | 26 | 27 | 26 | 25 | 24 | 2 | 22 | 2 | 20 | 19 | 18 | 17 | 16 | 
15 | 4] a3 | {| wo} o |e} 7 {eo} sj 4} a] 2}ialo| 


Bits 31-16 Device ID 
Device ID allocated for the W9960 is 9960H. 


Bits 15-0 Vendor ID 


Vendor ID is allocated by the PCI SIG to ensure uniqueness. The value assigned to 
Winbond Electronic Corp. is 1050H. 
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Status/Command Register 


Read/Write PCI Configuration Address: 04H 


Default: 0280 O0000H 


_31_| 30 | 20 | 28 | 27 | 26 | 25 | 24 | 23 | 22 | 2 | 20 | 19 | 18 | 17 | 16 | 


stu]ele[ulw[s[e[7[*]s]*]s]2]4]o 
Fesevea __——=Ss=~S~SC*«*dBE (SEE [PEE |Reconea ‘BENGE 


Bit 31 DPE ( this bit will be clear to 0 , if the write data in the corresponding bit location is a 1 ) 
Detected Parity Error 
1 = Parity Error 
0 = Parity Correct 

Bit 30 SSE ( this bit will be clear to 0 , if the write data in the corresponding bit location is a 1 ) 
Signaled System Error 
1 = System Error 


0 = System Correct 


Bit 29 RMA ( this bit will be clear to 0 , if the write data in the corresponding bit location is a 1 ) 
Received Master Abort 
1 = Master Abort On 


0 = Off 


Bit 28 RTA ( this bit will be clear to 0 , if the write data in the corresponding bit location is a 1 ) 
Received Target Abort 
1 = Target Abort On 


0 = Off 


Bit 27 STA (this bit will be clear to 0 , if the write data in the corresponding bit location is a 1 ) 
Signaled Target Abort 
1 = Target Abort On 


0 = Off 
Bits 26-25 DEVSEL Timing ( Read only ) 
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01 = Medium DEVSEL# timing 


Bit 24 MDP ( this bit will be clear to 0 , if the write data in the corresponding bit location is a 1 ) 
Master Data Parity Error Detected 
1 = Detect Parity Error 
0 = Parity Correct 
Bit 23 FBC 
Fast Back-to-Back Capable ( Read only ) 


1 = Support Fast Back-to-Back Capable 


Bits 22-17 Reserved 
Bit 16 CLK 


PCI CLK Support ( Read only ) 
0 = 33Mhz 


Bits 15-10 Reserved 


Bit 9 FBE 
Fast Back-to-Back Enable 


1 = Enable 
0 = Disable 
Bit 8 SERR# Enable 
1 = Enable 
0 = Disable 
Bit 7 Reserved 


Bit 6 PEE 
Parity Error Response Enable 


1 = Enable 
0 = Disable 


Bits 5-3 Reserved 


Bit 2 BME : Bus Master Enable 
1 = Enable 
0 = Disable 

Bit 1 MSE 
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Memory Space Enable 

1 = Enable the device response to memory space accesses 

0 = Disable the device response to memory space accesses 
Bit 0 Reserved 


Class Code/Revision ID Register 


Read-only PCI Configuration Address: 08H 


Default: 0400 0000H 


31 | 30 | 20 | 28 | 27 | 26 | 25 | 24 | 28 | 22 | 21 | 20 | 19 | 18 | 17 | 16 | 


Base Class Code Sub-Class Code 


stu [ele[n]wle]e|7l[*l[s]*]s]2] 1] 0] 


Bits 31-24 Base Class Code 

Bits 23-16 Sub-Class Code 

Bits 15-8 Programming Interface 
Bits 7-0 Revision ID 


Bits 31-8 are hardwired to 040000H to specify that the W9960 is a multimedia device. 


Header Type / Latency Timer Register 


Read-Write PCI Configuration Address: OCH 


Default: 0000 FFOOH 


rs[u[e[e@[ulols[e]7l[s|[s|[*]]*|+] 0 
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Bits 31-24 Reserved 
Bits 23-16 Header Type ( Read only ) 
OOH = Type 00H 
Bits 15-8 Latency Timer for Master 
the low-order 3 bits are read only, high-order 5 bits are read-writable 


Bits 7-0 Reserved 


Base Address 0 Register 


Read/Write PCI Configuration Address: 10H 


Default: 0000 O000H 


_31_| 30 | 29 | 28 | 27 | 26 | 25 | 24 | 2 | 22 | 2 | 20 | 19 | 18 | 17 | 16 | 


Base Address 0 


15 | 14] a3 | {jo} o |e} 7} eo} si 4] a] 2}ialo| 


Base Address 0 


Bits 31-0 Base Address 0 


This field can be used to relocate memory address space to any location that is aligned to 
4 Kbytes for mapping W9960 Engine Registers and Data Memory. 


The low-order 12bits are read only and hardwired to OOOH, the high-order 20bits are read- 
writable. 


Base Address 1 Register 


Read/Write PCI Configuration Address: 14H 


Default: 0000 O000H 


_31_| 30 | 29 | 26 | 27 | 26 | 25 | 24 | 28 | 22 | 2 | 20 | 19 | 18 | 17 | 16 | 


Base Address 1 


p15 | a fas} {| w]e | et 7 ie] sia] al] 2}ilo | 
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Base Address 1 


Bits 31-0 Base Address 1 


This field can be used to relocate memory address space to any location that is aligned to 
4 Mbytes for mapping W9960 DRAM Frame Memory. The low-order 22bits are read only 
and hardwired to 0_O000H, the high-order 10bits are read-writable. 


Interrupt Line Register 


Read/Write PCI Configuration Address: 3CH 


Default: OOH 


EARS VEE aes 


Interrupt Line 


Bits 7-0 Interrupt Line 


This 8-bit register is used to communicate interrupt line routing information. POST software 
will write the routing information into this register as it initializes and configures the system. 


Interrupt Pin Register 


Read-only PCI Configuration Address: 3DH 


Default: 01H 


p7 je | sp et ep eta fo | 


Interrupt Pin 


Bits 7-0 —_ Interrupt Line (hardwired to 01H) 


This register is hardwired to 1 to specify that INTA# is the interrupt pin used. 
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5. ELECTRICAL SPECIFICATIONS 


5.1 Absolute Maximum Ratings 


Ambient temperature 0°C to 70°C 


Storage temperature -40°C to 125°C 
DC supply voltage -0.5V to 5V 
I/O pin voltage with respect to Vss -0.5V to VDD+0.5V 


5.2 DC Specifications 


(Vpp = 3.0V to 3.6V, Vss = OV, Vppsv = 5V, TA=0° C to 70° C) 


[symbol | Parameter | Min | Typ | Max | Unit | Conditions _| 
| vy |inputtowvotage | | | os | v | 
Input High Voltage | 


Input High Voltage 


Vit 
Vin ; 
| Vo. foutputtow Votage |_| 
| Vow JoutputHighvotage | 24 | | 


V lonp=6MA 
lie Input Low Current A 


——_fontitor cure ua 


liq LA |_| 
wif} 
| Ow input Capacitance | | | to | pe | 
| Cour Output Capacitance | | | so | pp | 
icc |Power Supply Current_| | 4oof | ma | 


5.3 AC Specifications 
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5.3.1. CLOCK SPECIFICATION 


3.3 Volt Clock 


5.3.2 RESET TIMING 


Symbol Parameter 
System reset active pulse width 
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5.3.3. PCI INTERFACE AC TIMING 


OUTPUT 0.285VDD SiSINE edge) 
DELAY i 


Tri_State 
OUTPUT 


symbol | Parameter | Min | Max | Units | Notes _| 
[tu.__|ouK Stal vad Oetey-bused sina} 2 f tif rs {| 1 
| twa(ptp)_ [CLK to Signal Valid Delay - point to point | 2 | 12 | 


Float to Active Delay 


Active to Float ee 


tsu(ptp) _|lnput Set Up Time to CLK - point to point 
aa Input Hold Time from CLK Lo | 


Note 1: bused signals : AD31-ADO0, C/BE3-C/BE0, PAR, FRAME#, TRDY#, IRDY#, STOP#, 
DEVSEL#, PERR#, 


Note 2: point to point signals : REQ# is output signal, GNT# is input signal 
Note 3: Input Signal : IDSEL 
Note 4 : Output Signal : INTA#, SERR# 
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5.3.4 AUDIO INTERFACE AC TIMING 


A_ 0.4VDD 


toch 


emer Wonvoe 


t 


fsd 


| 0.285VDD (rising edge) 
0.645VDD (falling edge) 
t 


Synbel 
SCLK Period 


ii] i 

ca-.- 
Ref. to SCLK falling 

i. 
Ref. to SCLK falling 

| tus __[TFS/RFS Valid Delay from SCLK rising 


trav DT Valid Delay from TFS 
Alternate Frame Mode) 
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5.3.5 DRAM INTERFACE AC TIMING 


CLK_MEM 7 
RASO#,RAS1# r y 


2 
CASO#,CAS1# 


MA7-MAO Row Addr olumn Addr Column Addr 


WE1#,WE0# 


MD31-MDO 
(output) 


OE1#,OE0# 


MD31-MDO 
(input) 


te _|RAS# valid delay ref to CLK_MEM rising _| 
| te [RASH valid delay re. to CLK MEM falling _| 


ty [Memory Data valid delay | 
| tugs [Memory Data setuptime | 
[| tn [Memory Dataholdtime | 


Symbol 
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5.3.6 GPIO AC TIMING 


CLK_MEM y 


GPIO3-GPIOO 
(Output) 


GPIO3-GPIOO 
(Input) 


Symbol | ___ Parameter | Min. | Max | Unit 


tac GPIO output access 15 ns 
time 


GPIO input setuptime | 10 | | ns 
| tuoi | GPIOinputhoidtime | 5 | ns 
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5.3.7 VIDEO PREPROCESSOR AC TIMING 


CLK_Video 


HREF, VS, HS, VD[15:0] 


symbol | Parameter | Min_| Typ | Max | Unit 
| tuc [CLK videoperiog | || ns 
—tas.} cs viseoton ian [| __{_st_}__{_rw__ 
tac | CLK Video High wietn | 


tuotp Input Hold Time 
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5.3.8 ISA-LIkE Bus AC TIMING 


Waveform of IO Write Operation 


SAIOW# 


SAADDAIS-0 [Address 
eS 
SADATAT-0 [baa 


Waveform of |O Read Operation 


SAIOR# 
| Address 


SAADDR15-0 Fanece 


SADATA7-0 Data 
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6. APPENDIXES 
6.1 Porting Guide for W9960 Win95 Device Driver 


6.2 Firmware Loading Precedure 


6.3 Application/Firmware Command Block 
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